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■ We review the history of the parallel tempering simulation method. From its origins in data analysis, 
>• . the parallel tempering method has become a standard workhorse of physiochemical simulations. We 

discuss the theory behind the method and its various generalizations. We mention a selected set of 
7-h ! the many applications that have become possible with the introduction of parallel tempering and we 

■ suggest several promising avenues for future research. 

in ; 
o . 

1 Introduction 

o : 

££J The origins of the parallel tempering, or replica exchange, simulation technique can be traced to a 
^ ] 1986 paper by Swendsen and Wang. 1 In this paper, a method of replica Monte Carlo was introduced 

P" 1 ! in which replicas of a system of interest are simulated at a series of temperatures. Replicas at adjacent 
m ^ ■ temperatures undergo a partial exchange of configuration information. The more familiar form of 
^ ■ parallel tempering with complete exchange of configuration information was formulated by Geyer 

^ ! in 1991. 2 Initially, applications of the new method were limited to problems in statistical physics. 
However, following Hansmann's use of the method in Monte Carlo simulations of a biomolecule, 3 
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Falcioni and Deem's use of parallel tempering for X-ray structure determination, 4 and Okamoto and 
co-worker's formulation of a molecular dynamics version of parallel tempering, 5 the use of parallel 
tempering in fields spanning physics, chemistry, biology, engineering and materials science rapidly 
increased. 

The general idea of parallel tempering is to simulate M replicas of the original system of interest, 
each replica typically in the canonical ensemble, and usually each replica at a different temperature. 
The high temperature systems are generally able to sample large volumes of phase space, whereas 
low temperature systems, whilst having precise sampling in a local region of phase space, may be- 
come trapped in local energy minima during the timescale of a typical computer simulation. Parallel 
tempering achieves good sampling by allowing the systems at different temperatures to exchange 
complete configurations. Thus, the inclusion of higher temperature systems ensures that the lower 
temperature systems can access a representative set of low-temperature regions of phase space. This 
concept is illustrated in Figure 1 . 

Simulation of M replicas, rather than one, requires on the order of M times more computational effort. 
This 'extra expense' of parallel tempering is one of the reasons for the initially slow adoption of the 
method. Eventually, it became clear that a parallel tempering simulation is more than l/M times 
more efficient than a standard, single-temperature Monte Carlo simulation. This increased efficiency 
derives from allowing the lower temperature systems to sample regions of phase space that they 
would not have been able to access had regular sampling been conducted for a single-temperature 
simulation that was M times as long. While not essential to the method, it is also the case that 
parallel tempering can make efficient use of large CPU clusters, where different replicas can be run 
in parallel. An additional benefit of the parallel tempering method is the generation of results for a 
range of temperatures, which may also be of interest to the investigator. It is now widely appreciated 
that parallel tempering is a useful and powerful computational method. 

One of the debated issues in parallel tempering regards the details of the exchange, or swapping, of 
configurations between replicas. Pertinent questions include how many different replicas and at what 
temperatures to use, and how frequently swaps should be attempted, and the relative computational 
effort to expend on the different replicas. Another emerging issue is how to swap only part of the 
system, so as to overcome the growth as \CR of the number replicas required to simulate a system of 
size N. We address these points of controversy in this review. 

The widespread use of parallel tempering in the simulation field has led to the emergence of a number 
of new issues. It has also become clear that temperature may not always be the best parameter to 
temper, and parallel tempering can be conducted with order parameters other than temperature, such 
as pair potentials or chemical potentials. Of interest is how to choose the order parameter whose 
swapping will give the most efficient equilibration. It has become clear that multi-dimensional parallel 
tempering is possible. That is, swapping between a number of parameters in the same simulation, in 
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a multi-dimensional space of order parameters, is feasible and sometimes advised. The improvement 
in sampling resulting from the use of parallel tempering has revealed deficiencies in some of the 
most popular force fields used for atomistic simulations, and it would seem that the use of parallel 
tempering will be essential in tests of new and improved force fields. 

Parallel tempering can be combined with most other simulation methods, as the exchanges, if done 
correctly, maintain the detailed balance or balance condition of the underlying simulation. Thus, there 
is almost an unlimited scope for the utilization of the method in computer simulation. This leads to 
intriguing possibilities, such as combining parallel tempering with quantum methods. 



2.1 Theory of Monte Carlo Parallel Tempering. 

In a typical parallel tempering simulation we have M replicas, each in the canonical ensemble, and 
each at a different temperature, T { . In general Ti < T 2 < ... < T M , and 7\ is normally the temperature 
of the system of interest. Since the replicas do not interact energetically, the partition function of this 
larger ensemble is given by 



where g« = n^Li(27rm., /cbTj) 3 / 2 comes from integrating out the momenta, rrij is the mass of atom j, 
rf specifies the positions of the N particles in system i, f3i = 1/(/cb^) is the reciprocal temperature, 
and U is the potential energy, or the part of the Hamiltonian that does not involve the momenta. If 
the probability of performing a swap move is equal for all conditions, exchanges between ensembles 
% and j are accepted with the probability 



Swaps are normally attempted between systems with adjacent temperatures, j — % + 1. 

Parallel tempering is an exact method in statistical mechanics, in that it satisfies the detailed balance 
or balance condition, 6 depending on the implementation. This is an important advantage of paral- 
lel tempering over simulated annealing, as ensemble averages cannot be defined in the latter method. 
Parallel tempering is complementary to any set of Monte Carlo moves for a system at a single temper- 
ature, and such single-system moves are performed between each attempted swap. To satisfy detailed 
balance, the swap moves must be performed with a certain probability, although performing the swaps 
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A = min{l, exp +(# - ^)(f/(rf ) - U(vf )) } . 



(2) 
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after a fixed number of single-temperature Monte Carlo moves satisfies the sufficient condition of bal- 
ance. 6 A typical sequence of swaps and single-temperature Monte Carlo moves is shown in Figure 2. 



Kofke conducted an analysis of the average acceptance rate, (A), of exchange trials and argued that 
this quantity should be related to the entropy difference between phases. 7 9 For systems assumed to 
have Gaussian energy distributions, typical of many systems that are studied using computer simula- 
tion, see Figure 3, he found the average acceptance ratio, (A), to be given by 



(A) = erfc 



2 1 



1/2 1 - Pj/Pi 



;i + ova) 2 ) 1/2 



(3) 



where C v is the heat capacity at constant volume, which is assumed to be constant in the temperature 
range between /3j and (3j. Simply put, the acceptance rate for the trials depends on the likelihood 
that the system sampling the higher temperature happens to be in a region of phase space that is 
important at the lower temperature. This theoretical analysis of the acceptance rates becomes useful 
when considering the optimal choice of temperatures for a parallel tempering simulation (see Section 
2.3). 



2.2 Theory of Molecular Dynamics Parallel Tempering. 

In Monte Carlo implementations of parallel tempering, we need only consider the positions of the 
particles in the simulation. In molecular dynamics, we must also take into account the momenta of all 
the particles in the system. Sugita and Okamoto proposed a parallel tempering molecular dynamics 
method in which after an exchange, the new momenta for replica i, , should be determined as 




pW= i^L p W } ( 4 ) 



where pW are the old momenta for replica i, and T \ d and T new are the temperatures of the replica 
before and after the swap, respectively. 5 This procedure ensures the average kinetic energy remains 
equal to ^Nk B T. The acceptance criterion for an exchange remains the same as for the MC imple- 
mentation (Eqn.El) and satisfies detailed balance. 



When doing parallel tempering molecular dynamics, one must take care in the interpretation of the 
results. A parallel tempering exchange is an 'unphysical' move, and so one cannot draw conclusions 
about dynamics. That is, when using parallel tempering molecular dynamics, one is only really doing 
a form of sampling and not 'true' molecular dynamics. 



2.3 Optimal Choice of Temperatures. 

How one chooses both the number of replicas employed in a parallel tempering simulation and the 
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temperatures of the replicas are questions of great importance. One wishes to achieve the best pos- 
sible sampling with the minimum amount of computational effort. The highest temperature must be 
sufficiently high so as to ensure that no replicas become trapped in local energy minima, while the 
number of replicas used must be large enough to ensure swapping occurs between all adjacent repli- 
cas. Several suggestions for the number and temperature of the replicas have been offered. It is clear 
from Figure 3 and Eqn.|2]that the energy histograms must overlap for swaps to be accepted. Sugita et 
al. and Kofke have proposed that the acceptance probability could be made uniform across all of the 
different replicas, in an attempt to ensure that each replica spends the same amount of simulation time 
at each temperature. 5 ' 7 8 Kofke showed that a geometric progression of temperatures (|f = constant) 
for systems in which C v is constant across the temperatures results in equal acceptance ratios. San- 
bonmatsu et al. suggested that a target acceptance ratio, Aarget, can be obtained iteratively by solving 

^target = exp[A/3A£] , (5) 

where AE is the difference in the average energies of the systems in adjacent temperatures. 10 Itera- 
tive methods for adjusting the temperatures of the different systems to ensure that acceptance ratios 
stay within acceptable bounds had previously been proposed and utilized by Falcioni 11 and Schug et 
al. 12 in adaptive temperature control schemes. Rathore et al. 13 extended these approaches to sug- 
gest a scheme for the optimal allocation of temperatures to replicas that is also based on iteratively 
altering system temperatures. In their scheme, the lowest temperature is fixed, and the other system 
temperatures are determined by iteratively solving 



AE 



AE 



a 



(6) 

target 



for each of the temperatures , Tj , where a m = [a (Tj) +a (Tj)] /2 is the average deviation of the energies 
in the two systems. One can choose the target value to achieve a desired acceptance ratio. 

Rathore et al. also consider the optimal acceptance ratio and number of replicas in parallel tempering 
simulations. 13 For the case studies used in their work, they found that an acceptance ratio of 20% 
yielded the best possible performance. That is, adding more replicas once the high and low tem- 
peratures are fixed and the acceptance ratio of 20% had been achieved resulted in no increase in the 
performance of the simulation. 

Recently, Kone and Kofke have provided an analysis of the selection of temperature intervals in sys- 
tems where C v is assumed to be piecewise constant across each temperature interval. 14 They argue 
that although this may not always be the case, the assumption is reasonable and does not require an 
iterative scheme that can consume valuable CPU time and which violates detailed balance. Their 
analysis is based on maximising the mean square displacement, a 2 , of a system as it performs the 
random walk over temperatures. The value of a 2 is proportional to the number of accepted swaps and 
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(ln(Pj/ Pi)) 2 . By maximizing a 2 with respect to the acceptance probability, they found that an ac- 
ceptance probability of 23% is optimal. This value is strikingly similar to the empirically determined 
20% of Rathore et al. Kone and Kofke suggest "tuning" the temperature intervals to achieve the 23% 
acceptance probability during the initial equilibration of a simulation. This approach appears to be an 
efficient method to select temperature intervals in parallel tempering simulations that mix efficiently. 

A similar scheme for choosing the temperatures has recently been proposed by Katzgraber et al, 
which uses an adaptive feedback-optimized algorithm to minimize round-trip times between the low- 
est and highest temperatures. 15 This approach more directly characterizes the mixing between the high 
and low temperature systems. In complex cases, where there are subtle bottlenecks in the probability 
of exchange of configurations, the round-trip time is likely to better characterize the overall efficiency 
of parallel tempering than is the average acceptance probability. The approach of Katzgraber et al. is 
a promising one for such complex cases. 

A related issue is how much simulation effort should be expended on each replica. For example, it 
would seem that the low temperature replicas would benefit from additional simulation effort, as the 
correlation times at lower temperature are longer. This issue is untouched in the literature. 

Since the width of the energy histograms increases as \/~N, but the average energy increases as N, 
the number of replicas increases as \/~N, where TV is is the system size. 4 One, therefore, would like a 
method where only part of the configurational degrees of freedom are exchanged. Interestingly, this 
issue was solved in Swendsen and Wang's 1986 paper for spin systems, 1 but it has not been solved in 
an exact, efficient way for atomistic systems. The main difficulty seems to be in defining a piece of a 
system that can be exchanged without the penalty of a large surface energy. 

2.4 Parallel Tempering with Alternative Parameters and Sampling Methods. 

The general idea of parallel tempering is not limited to exchanges or swaps between systems at differ- 
ent temperatures. Investigators have developed a number of methods based on swapping alternative 
parameters in order to minimize barriers that inhibit correct sampling. Additionally, parallel temper- 
ing can be combined with a large number of alternative sampling methods, and its use has led to a 
great improvement in the sampling of many existing computational methods. 

Fukunishi et al. developed a Hamiltonian parallel tempering method that they applied to biomolecular 
systems. 16 In this approach, only part of the interaction energy between particles is scaled between the 
different replicas. In their work, they conduct case studies using two different implementations of their 
approach. In the first they scale hydrophobic interactions between replicas. In the second, they scale 
the van der Waals interactions between replicas by introducing a cut-off in the interaction, effectively 
allowing chains of atoms to pass through each other. The acceptance probability in Hamiltonian 
parallel tempering for a swap between replicas i and j is given by 

A = min{l, exp [-0 {[H^X') + H 3 (X)} - [H^X) + H 3 {X')])}} , (7) 
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where H^X) is the Hamiltonian of configuration X in replica i, and configurations X and X' are the 
configurations in replicas i and j, respectively, prior to the swap attempt. 

Parallel tempering using multiple swapping variables was first proposed and developed by Yan and 
de Pablo. 1718 Instead of considering a one-dimensional array of replicas at different temperatures, 
they suggested using an n-dimensional array, where each dimension represented a parameter that 
varied between replicas. Swaps both within and between dimensions were allowed in their scheme. 
In their first work they conducted parallel tempering between different temperatures and chemical 
potentials in the grand canonical ensemble, but the scheme they proposed was general. They showed 
that extensions of parallel tempering to multiple dimensions are limited only by the imagination of 
the investigator in choosing the variables to swap and the available computational resources. Sugita 
et al. utilized multdimensional exchanges in molecular dynamics studies. 19 

de Pablo and co-workers also implemented parallel tempering in the multicanonical ensemble. 20 In 
the multicanonical ensemble, the probability distribution is no longer Boltzmann, but becomes 

p(r N ) = (const)e- /3C/(riV) w(r Ar ) (8) 

The weight factors, w(r N ), are chosen so as to lower the barriers in the system, de Pablo and co- 
workers derived multicanonical weights by an iterative process using a Boltzmann inversion of his- 
tograms. Another way to write Eq. ® is to use instead of the Hamiltonian U, the weighted Hamilto- 
nian U + £(U) when attempting swap moves, where £(U) is an umbrella potential. By using a multi- 
canonical ensemble, de Pablo and co-workers were able to reduce the number of replicas required in 
their simulation, due to a broader overlap of thermodynamic-property histograms. In general, when 
combined with a multicanonical simulation, a short parallel tempering run can be performed, and 
the multicanonical weight factors can be determined by using histogram reweighting. These weights 
can then be used in the multicanonical part of the calculation. 21 Parallel tempering can be combined 
with a multicanonical simulation. That is, in the multicanonical simulation, a number of replicas, 
each in the multicanonical ensemble but each with different multicanonical weight factors covering 
different energy ranges, may be employed. 21 It should be noted that far fewer replicas are needed in 
this method than in typical parallel tempering because the energy ranges covered in a multicanonical 
simulation are far wider than in a canonical simulation. The weight factors utilized in these methods 
may then be iteratively improved during the equilibration period as the simulation proceeds, using 
histogram reweighting techniques. 

In free energy perturbation calculations, a parameter A is introduced. One wishes to compute the free 
energy difference to go from an initial (A = 0) state and a final (A = 1) state. For parallel tempering 
with free energy perturbation one can consider M replicas, each with a different A parameter, where 
each replica has a slightly different Hamiltonian 

U x = U x=0 + X(U X=1 - U x=0 ) . (9) 
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Swaps may be attempted between replicas using the Hamiltonian acceptance criterion (Eq.|7]), and the 
free energy difference between two lambda parameters can be determined as in regular free energy 
calculations. Of course, one may utilize a number of different temperature replicas for each value 
of A in a multidimensional approach. Use of parallel tempering in multicanonical simulations, free 
energy calculations, and umbrella sampling is growing. 19 25 

One of the most fruitful combinations of parallel tempering with existing sampling techniques has 
been with density of states methods based on Wang-Landau sampling. 26 Density of states methods 
are similar to multicanonical ones in that the weight factor is the reciprocal of the density of states. 
However, in density of states methods a random walk in energy space is conducted, and a running 
estimate of the inverse of the density of states as a function of the energy is performed. Alternatively 
the configurational temperature is collected as a function of the energy and the density of states deter- 
mined by integrating the inverse temperature. 27 Other sets of conjugate variables can also profitably 
be used. 28 These methods effectively circumvent the tedious and time consuming process of calculat- 
ing weight factors in multicanonical simulations, de Pablo and co-workers have proposed extended 
ensemble density of states methods where overlapping windows or replicas of different energy or 
reaction/transition coordinate values are utilized. 29 Configurational swaps between windows are at- 
tempted at regular intervals to prevent the simulations in the parallel replicas from becoming stuck in 
non-representative regions of phase space. A combination of density of states methods and parallel 
tempering has successfully been used to study protein folding 30 32 and solid-liquid equilibria. 33 

Vlugt and Smit applied parallel tempering to the transition path sampling method. 34 They showed 
that parallel tempering conducted between different temperatures and between different regions along 
transition paths is able to overcome the problem of multiple saddle points on a free energy surface. 
Parallel tempering transition path sampling can provide for more accurate estimates of transition rates 
between stable states than single-temperature Monte Carlo transition path sampling. 

Parallel tempering has been combined with a number of other computational methods, and in almost 
all cases its use has resulted in better sampling and an increase in the accuracy of the computational 
method. Prominent examples include parallel tempering with cavity bias to study the phase diagram 
of Lennard- Jones fluids, 35 with analytical rebridging for the simulation of cyclic peptides, 36 and with 
the wormhole algorithm to explore the phase behavior of random copolymer melts. 37 

Very recently an extension to parallel tempering, known as Virtual-Move Parallel Tempering, has 
been proposed by Coluzza and Frenkel. 38 In their scheme they include information about all possible 
parallel tempering moves between all replicas in the system, rather than just between adjacent replicas, 
when accumulating statistical averages. This approach is essentially a parallel tempering version of 
the "waste recyling" Monte Carlo method of Frenkel 39 and has been shown to improve statistical 
averaging by up to a factor of 20. 
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2.5 Non-Boltzmann Distributions. 

Since their introduction in the late 1980s, Tsallis statistics have become increasingly important in 
statistical mechanics. 40 Due to their power-law, rather than Boltzmann, properties, Tsallis statistics 
generally lead to smaller energy barriers. Therefore, optimization with Tsallis, rather than Boltzmann, 
statistics can be very useful in energy minimization problems. Whitfield et al. have developed a 
version of the parallel tempering algorithm that is based upon Tsallis statistics. 41 This method has 
been used, for example, for fast conformational searches of peptide molecules. 42 



3 Applications 

3.1 Polymers. 

Simulations of polymeric systems are notoriously difficult due to chain tangling, the high density 
of the systems of interest, and the large system sizes required to accurately model high molecular 
weight species. The first application of parallel tempering to polymeric systems was by Yan and de 
Pablo to high molecular weight species. 18 Bunker and Dunweg 43 were the first to utilize excluded vol- 
ume parallel tempering, where different replicas have different core potentials. They studied polymer 
melts for polymer chain lengths ranging from 60 to 200 monomers. Their method created a thermo- 
dynamic path from the full excluded volume system to an ideal gas of random walks and increased 
the efficiency of all their simulations. Bedrov and Smith 44 studied fully atomistic polymer melts of 
1,4-polybutadiene at a range of temperatures, performing parallel tempering swaps isobarically. They 
showed that their parallel tempering approach provided a substantial improvement in equilibration 
and sampling of conformational phase space when compared to regular MD simulations. See Fig- 
ure 4. Theodorou and co-workers studied cis-1,4 polyisoprene melts using parallel tempering and 
once again found that use of parallel tempering resulted in far quicker equilibration over a range of 
temperatures. 45 More recently, Banaszak et al. have utilized hyperparallel tempering in an osmotic 
ensemble to study the solubility of ethylene in low-density polyethylene 46 Using their novel method 
they were able to examine the effect of both polyethylene chain length and branching on the solubility 
of ethylene. 

3.2 Proteins. 

Biological systems, particularly proteins, are computationally challenging because they have partic- 
ularly rugged energy landscapes that are difficult for regular Monte Carlo and molecular dynamics 
techniques to traverse. Hansmann was the first to apply parallel tempering to biological molecules in a 
Monte Carlo based study of the simple 7-amino acid Met-enkephalin peptide. 3 Hansmann showed that 
parallel tempering based simulations could overcome the "simulation slowdown" problem and were 
more efficient than regular canonical Monte Carlo simulations. The application of parallel tempering 
to biological problems, however, did not take-off until Sugita and Okamoto's work that introduced the 
use of molecular dynamics parallel tempering. 5 They applied their approach to Met-enkephalin and 
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demonstrated that their parallel tempering based method did not get trapped in local energy minima, 
unlike regular microcanonical molecular dynamics simulations of the same molecule. 

Following demonstration of the power of parallel tempering for molecular systems, its use in the bio- 
logical simulation community rapidly expanded. Parallel tempering has been used to determine fold- 
ing free energy contour maps for a number of proteins, revealing details about folding mechanisms 
and intermediate state structures 47 51 and has facilitated the simulation of membrane proteins. 52 55 
Parallel tempering has proved to be particularly powerful when applied to NMR structure refinement 
and in the interpretation of data from NMR, 56 60 circular dichroism, 61 IR spectra, 62 and electric deflec- 
tion data 63 of proteins and peptides. For models of globular proteins and oligomeric peptides, parallel 
tempering has been used to study previously unexplored regions of phase diagrams and to sample ag- 
gregate transitions. 64 ' 65 In the study of sucrose solutions near the glass transition temperature, parallel 
tempering simulations showed a much better fit to experimental data than did conventional NPT MC 
results. 66 Other interesting work using parallel tempering includes studies of the thermodynamics of 
fibril formation using an intermediate resolution protein model 67 and of the hypervariable regions of 
an antibody domain where the chief interactions governing conformational equilibria in these systems 
were determined. 68 

With this increased sampling ability of parallel tempering has come the realization that current force 
fields for biological simulation are lacking in some respects. Parallel tempering simulations of sol- 
vated biological molecules have also revealed deficiencies in popular implicit solvent models. 51 ' 69 ' 70 
As parallel tempering can also be used with explicit solvent models, the differences between the treat- 
ments can be determined, and in the future such simulations could be used to improve implicit solvent 
models. 

Brooks and co-workers have developed a multiscale modeling toolkit that can interface with the pop- 
ular CHARMM and AMBER molecular simulation codes. 71 Parallel tempering is implemented in the 
toolkit to allow enhanced sampling and is used to study the ab initio folding of peptides from first 
principles. Parallel tempering has clearly become the method of choice in ab initio protein folding as 
evidenced by the the work of Skolnick and co-workers, 72 Garcia and Sanbonmatsu, 10 ' 73 ' 74 and Yang 
et al. 15 

When examining the biological and chemical literature of parallel tempering, it is apparent that the 
vast majority of work is based on molecular dynamics, rather than Monte Carlo. As one is not doing 
'true' MD when using parallel tempering, there is no reason why Monte Carlo methodologies cannot 
be implemented more frequently in the biological and chemical communities. Indeed, we expect this 
to be a promising avenue for future research. 

3.3 Solid state. 

Crystal structure solution provided one of the first mainstream atomistic simulation examples of the 
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power of parallel tempering. Falcioni and Deem used parallel tempering in a biased MC scheme to 
determine the structures of zeolites from powder diffraction data. 76 For complicated zeolite structures 
containing more than eight unique tetrahedral atoms, simulated annealing is unable to solve the crystal 
structures. However, parallel tempering simulations were shown to be able to solve the structures of 
all complex zeolites, including the most complicated zeolite structure, ZSM-5, which contains twelve 
unique tetrahedral atoms. Zefsall has since been successfully used to solve the structures of at least 
a dozen newly synthesized zeolites and is freely downloadable on the web. A similar approach to 
crystal structure determination from powder diffraction data has been implemented by Favre-Nicolin 
et ah, 11 and this method has been successful in solving several structures. 78-80 

A seminal simulation study of the rate of crystal nucleation by Auer and Frenkel utilized the parallel 
tempering method by allowing swaps between 'windows' at different points along the reaction co- 
ordinate from the liquid to solid state. 81 This work introduced, for the first time, the ability to calculate 
nucleation rates from first principles. 

Other examples of solid-state parallel tempering simulations include the computation of sodium ion 
distributions in zeolites, 82 studying the finite temperature behavior of C 60 clusters, 83 the simulation of 
Si surfaces, 84 85 and the explanation of the titration behavior of MbCO over a range of pH values. 86 

3.4 Spin glass. 

Spin glasses have provided a severe test of the effectiveness of parallel tempering. 87 In the Parisi 
solution of the infinite range Edwards-Anderson model, widely believed by many but not all physicists 
to apply to finite-range spin glasses as well, there is a finite energy for excitations above the ground 
state, and the boundary of these excitations has a space-filling structure. Initial simulations for the 
Edwards-Anderson model confirmed the finite excitation energy. 88 Initial suggestions for a fractal 
surface 88 were ruled out by later simulations. 89 For the vector spin glass model, the excitation energy 
was again found to be finite. 90 Initial suggestions of a fractal surface were also largely ruled out in 
later simulations. 91 

3.5 Quantum. 

Quantum level systems, whilst being far more computationally demanding than classical systems, 
may benefit from the improved sampling provided by parallel tempering. So far, the main application 
of parallel tempering at the quantum level has been in studies of phase transitions and in the location 
of energy minima in complex systems. Parallel tempering is ideal for these studies, as dynamics 
are not of interest. Okamoto and co-workers conducted parallel tempering based ab initio correlated 
electronic structure calculations. 92 In their studies of Li clusters, they demonstrated that parallel 
tempering could be successfully applied to systems described with a high level of detail. Sengupta 
et al. combined quantum Monte Carlo with parallel tempering to study the phase diagram of a 1-D 
Hubbard model. 93 Quantum parallel tempering was found to significantly reduce "sticking" effects, 
where the simulation gets stuck in the incorrect phase close to the phase boundary. 
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Shin et al. have studied quantum phase transitions of water clusters, 94 where the rotational modes can 
be highly quantum. Parallel tempering allowed for efficient conformational sampling. They remark 
that "combining Car-Parrinello approach with replica exchange [parallel tempering] and path integral 
molecular dynamics can provide an ideal methodology for studying quantum behavior of clusters." 
Although the suggested approach is highly computationally expensive, it may become increasingly 
feasible in future years. Parallel tempering has also been successfully employed in a study of the 
finite temperature optical spectroscopy of CaAr n clusters 95 and in quantum path integral simulations 
of the solid-liquid phase diagrams of Nei 3 and (para-H 2 )i3 clusters. 96 

3.6 General Optimization Problems. 

Parallel tempering has been successfully used in a number of general optimization problems. Habeck 
et al. developed a sampling algorithm for the exploration of probability densities that arise in Bayesian 
data analysis. 97 Their approach utilized Tsallis statistics, and the effectiveness of parallel tempering 
was demonstrated by interpreting experimental NMR data for a folded protein. In image analysis, 
parallel tempering has been shown to lead to an improvement by a factor of two for both success rate 
and mean position error when compared to simulated annealing approaches. 98 Parallel tempering has 
also been utilized to locate the global minima of complex and rugged potential energy surfaces that 
arise in atomistic models of receptor-ligand docking 99 and in risk analysis. 100 



4 Conclusion 

In this review we have given an overview of the history of parallel tempering. We have described 
the basic theory and many of the extensions to the original method. Several examples in a variety 
of physiochemical arenas have been discussed. Highlighted technical aspects to sort out include best 
allocations to cluster computers, 101 determination of the optimal amount of simulation effort to expend 
on each replica, and partial swapping of partial configuration information for atomistic systems. 

A number of potential new areas for application of parallel tempering occur to us. One rather large 
one is the application of parallel tempering, rather than simulated annealing, 102 to X-ray single-crystal 
structure solution. A related issue is the prediction of polymorphs for crystals of small, organic drug 
molecules. Also related is use of parallel tempering in rational drug design — most current approaches 
use grid searching, traditional Monte Carlo, or at best simulated annealing. 103 Another physical appli- 
cation where enhanced sampling might be of use is in field theories for polymeric systems with non- 
trivial phase structure. 104 Also possible would be the complementary inclusion in ab initio molecular 
dynamics, if sampling only is desired. Even experimental applications could be possible in materials 
discovery 105 or laboratory protein evolution. 106 
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Figure 1 : 2-D representation of phase space. A simulation at lower temperatures can become trapped 
in a non-representative sample of the low free energy minima (shaded regions). At higher tempera- 
tures, a simulation can sample more of phase space (light plus shaded areas). Configuration swaps 
between the lower and higher temperature systems allow the lower temperature systems to escape 
from one region of phase space where they were effectively 'stuck' and to sample a representative set 
of the low free energy minima. 
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Figure 2: Schematic representation of parallel tempering swaps between adjacent replicas at different 
temperatures. In between the swaps, several constant-temperature Monte Carlo moves are performed. 
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Figure 4: End-to-end correlation function for polymeric 1,4-polybutadiene. The parallel tempering 
simulation relaxes much more quickly, and is, thus, a more efficient simulation. Used with permis- 
sion. 44 Copyright 2001, American Institute of Physics. 
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